Analysis of protein domain families in Caenorhabditis elegans.
نویسندگان
چکیده
The Caenorhabditis elegans genome sequencing project has completed over half of this nematode's 100-Mb genome. Proteins predicted in the finished sequence have been compiled and released in the data-base Wormpep. Presented here is a comprehensive analysis of protein domain families in Wormpep 11, which comprises 7299 proteins. The relative abundance of common protein domain families was counted by comparing all Wormpep proteins to the Pfam collection of protein families, which is based on recognition by hidden Markov models. This analysis also identified a number of previously unannotated domains. To investigate new apparently nematode-specific protein families, Wormpep was clustered into domain families on the basis of sequence similarity using the Domainer program. The largest clusters that lacked clear homology to proteins outside Nematoda were analyzed in further detail, after which some could be assigned a putative function. We compared all proteins in Wormpep 11 to proteins in the human, Saccharomyces cerevisiae, and Haemophilus influenzae genomes. Among the results are the estimation that over two-thirds of the currently known human proteins are likely to have a homologue in the whole C. elegans genome and that a significant number of proteins are well conserved between C. elegans and H. influenzae, that are not found in S. cerevisiae.
منابع مشابه
HIDDEN MARKOV MODELS AND LARGE - SCALE GENOMEANALYSISSean
PFAM is a database of multiple alignments and hidden Markov models (HMMs) of common, conserved protein domains. PFAM HMMs complement BLAST analysis in the annotation of the C. elegans and human genome sequencing projects at Washington University and the Sanger Centre. PFAM2, based on full, gapped multiple alignments of structural and/or functional protein domains, currently contains 527 models....
متن کاملMapping and analysis of Caenorhabditis elegans transcription factor sequence specificities
Caenorhabditis elegans is a powerful model for studying gene regulation, as it has a compact genome and a wealth of genomic tools. However, identification of regulatory elements has been limited, as DNA-binding motifs are known for only 71 of the estimated 763 sequence-specific transcription factors (TFs). To address this problem, we performed protein binding microarray experiments on represent...
متن کاملاستفاده از تعامل نماتد Caenorhabditis elegans، قارچ Arthrobotrys oligospora و باکتری Bacillus subtilis در کنترل نماتد Meloidogyne javanica
در این تحقیق از تعامل نماتد Caenorhabditis elegans، قارچ Arthrobotrys oligospora و باکتری Bacillus subtilis در کنترل نماتد مولد گره ریشه Meloidogyne javanica استفاده شد. باکتری مذکور جهت تحریک سیستم دفاعی گیاه در بدو تیمار و به عنوان غذای نماتد C. elegans و نماتد C. elegans به منظور افزایش تولید تله استفاده شد. قارچoligospora A. پس از 72 ساعتموجب مرگ و میر 77% لاروهای نماتد M. javanica گردید، ...
متن کاملDetermination of the effects of food preservatives benzoic acid and sodium nitrate on lifespan, fertility and physical growth in Caenorhabditis elegans
Presently, the use of protective food additives such as benzoic acid and sodium nitrate is quite common. However, it was found that these additives, which initially appeared to be harmless, led to the emergence of a number of health problems. Cancer and diseases and deaths with no apparent causes are among the leading concerns. Therefore, the studies which can reveal the genotoxic potential of ...
متن کاملTwo large families of chemoreceptor genes in the nematodes Caenorhabditis elegans and Caenorhabditis briggsae reveal extensive gene duplication, diversification, movement, and intron loss.
The str family of genes encoding seven-transmembrane G-protein-coupled or serpentine receptors related to the ODR-10 diacetyl chemoreceptor is very large, with at least 197 members in the Caenorhabditis elegans genome. The closely related stl family has 43 genes, and both families are distantly related to the srd family with 55 genes. Analysis of the structures of these genes indicates that a t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Genomics
دوره 46 2 شماره
صفحات -
تاریخ انتشار 1997